SMAP: a streamlined methylation analysis pipeline for bisulfite sequencing
نویسندگان
چکیده
BACKGROUND DNA methylation has important roles in the regulation of gene expression and cellular specification. Reduced representation bisulfite sequencing (RRBS) has prevailed in methylation studies due to its cost-effectiveness and single-base resolution. The rapid accumulation of RRBS data demands well designed analytical tools. FINDINGS To streamline the data processing of DNA methylation from multiple RRBS samples, we present a flexible pipeline named SMAP, whose features include: (i) handling of single-and/or paired-end diverse bisulfite sequencing data with reduced false-positive rates in differentially methylated regions; (ii) detection of allele-specific methylation events with improved algorithms; (iii) a built-in pipeline for detection of novel single nucleotide polymorphisms (SNPs); (iv) support of multiple user-defined restriction enzymes; (v) conduction of all methylation analyses in a single-step operation when well configured. CONCLUSIONS Simulation and experimental data validated the high accuracy of SMAP for SNP detection and methylation identification. Most analyses required in methylation studies (such as estimation of methylation levels, differentially methylated cytosine groups, and allele-specific methylation regions) can be executed readily with SMAP. All raw data from diverse samples could be processed in parallel and 'packetized' streams. A simple user guide to the methylation applications is also provided.
منابع مشابه
SAAP-RRBS: streamlined analysis and annotation pipeline for reduced representation bisulfite sequencing
UNLABELLED Reduced representation bisulfite sequencing (RRBS) is a cost-effective approach for genome-wide methylation pattern profiling. Analyzing RRBS sequencing data is challenging and specialized alignment/mapping programs are needed. Although such programs have been developed, a comprehensive solution that provides researchers with good quality and analyzable data is still lacking. To addr...
متن کاملMethy-Pipe: An Integrated Bioinformatics Pipeline for Whole Genome Bisulfite Sequencing Data Analysis
DNA methylation, one of the most important epigenetic modifications, plays a crucial role in various biological processes. The level of DNA methylation can be measured using whole-genome bisulfite sequencing at single base resolution. However, until now, there is a paucity of publicly available software for carrying out integrated methylation data analysis. In this study, we implemented Methy-P...
متن کاملVM : a virtual machine for the integral analysis of bisulfite sequencing data
The analysis of whole genome DNA methylation patterns is an important first step towards the understanding on how DNA methylation is involved in the regulation of gene expression and genome stability. Previously, we published MethylExtract, a program for DNA methylation profiling and genotyping from the same sample. Over the last years we developed it further into a methylation analysis pipelin...
متن کاملComputational Analysis of Genome-Wide ARGONAUTE-Dependent DNA Methylation in Plants.
Whole-genome bisulfite sequencing (WGBS) has become a powerful tool to dissect genome-wide methylation profiles at single-base resolution. In this chapter we describe in detail the bioinformatics pipeline used for the analysis of ARGONAUTE-dependent DNA methylation in Arabidopsis thaliana. We provide tools and command lines used for mapping bisulfite sequencing reads, for estimating methylation...
متن کاملDetection of significantly differentially methylated regions in targeted bisulfite sequencing data
MOTIVATION Bisulfite sequencing is currently the gold standard to obtain genome-wide DNA methylation profiles in eukaryotes. In contrast to the rapid development of appropriate pre-processing and alignment software, methods for analyzing the resulting methylation profiles are relatively limited so far. For instance, an appropriate pipeline to detect DNA methylation differences between cancer an...
متن کامل